Meta-Reinforcement Learning Based Resource Allocation for Dynamic V2X Communications

نویسندگان

چکیده

This paper studies the allocation of shared resources between vehicle-to-infrastructure (V2I) and vehicle-to-vehicle (V2V) links in vehicle-to-everything (V2X) communications. In existing algorithms, dynamic vehicular environments quantization continuous power become bottlenecks for providing an effective timely resource policy. this paper, we develop two algorithms to deal with these difficulties. First, propose a deep reinforcement learning (DRL)-based algorithm improve performance both V2I V2V links. Specifically, uses Q-network (DQN) solve sub-band assignment deterministic policy-gradient (DDPG) problem. Second, meta-based DRL enhance fast adaptability policy environment. Numerical results demonstrate that proposed DRL-based can significantly compared DQN-based quantizes power. addition, achieve required adaptation new environment limited experiences.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Reinforcement Learning for Resource Allocation in V2V Communications

In this article, we develop a decentralized resource allocation mechanism for vehicle-to-vehicle (V2V) communication systems based on deep reinforcement learning. Each V2V link is considered as an agent, making its own decisions to find optimal sub-band and power level for transmission. Since the proposed method is decentralized, the global information is not required for each agent to make its...

متن کامل

A reinforcement learning approach to dynamic resource allocation

This paper presents a general framework for performing adaptive reconfiguration of a distributed system based on maximizing the long-term business value, defined as the discounted sum of all future rewards and penalties. The problem of dynamic resource allocation among multiple entities sharing a common set of resources is used as an example. A specific architecture (DRA-FRL) is presented, whic...

متن کامل

Online Resource Allocation Using Decompositional Reinforcement Learning

This paper considers a novel application domain for reinforcement learning: that of “autonomic computing,” i.e. selfmanaging computing systems. RL is applied to an online resource allocation task in a distributed multi-application computing environment with independent time-varying load in each application. The task is to allocate servers in real time so as to maximize the sum of performance-ba...

متن کامل

Context-Aware Dynamic Resource Allocation for Cellular M2M Communications

While the huge number of machine-to-machine (M2M) devices connects to the LTE system, the bursty random access attempts from them are potential to cause severe random access collisions and degrade the successful probability as well as delay of network connection establishment. To address this issue, current LTE spec has specified access class barring (ACB) and enhanced access barring (EAB), but...

متن کامل

Market-Based Dynamic Task Allocation Using Heuristically Accelerated Reinforcement Learning

This paper presents a Multi-Robot Task Allocation (MRTA) system, implemented on a RoboCup Small Size League team, where robots participate of auctions for the available roles, such as attacker or defender, and use Heuristically Accelerated Reinforcement Learning to evaluate their aptitude to perform these roles, given the situation of the team, in real-time. The performance of the task allocati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Vehicular Technology

سال: 2021

ISSN: ['0018-9545', '1939-9359']

DOI: https://doi.org/10.1109/tvt.2021.3098854